Search CORE

316 research outputs found

PhyloNet: a software package for analyzing and reconstructing reticulate evolutionary relationships

Author: B Moret
C Buerkle
C Linder
C Semple
C Than
C Than
C Than
Cuong Than
D Bryant
D Huson
D MacLeod
D Penny
D Posada
D Posada
D Robinson
D Ruths
D Ruths
D Ruths
Derek Ruths
DH Huson
DL Swofford
G Jin
H Ochman
H Shimodaira
I Kanj
J Felsenstein
J Felsenstein
J Mower
L Nakhleh
L Nakhleh
L Nakhleh
L Nakhleh
L Rieseberg
Luay Nakhleh
M Baroni
M Bordewich
M Hallett
M Steel
MM Morin
N Ellstrand
R Beiko
U Bergthorsson
U Bergthorsson
U Bergthorsson
V Makarenkov
Publication venue: BioMed Central
Publication date: 01/07/2008
Field of study

Abstract Background Phylogenies, i.e., the evolutionary histories of groups of taxa, play a major role in representing the interrelationships among biological entities. Many software tools for reconstructing and evaluating such phylogenies have been proposed, almost all of which assume the underlying evolutionary history to be a tree. While trees give a satisfactory first-order approximation for many families of organisms, other families exhibit evolutionary mechanisms that cannot be represented by trees. Processes such as horizontal gene transfer (HGT), hybrid speciation, and interspecific recombination, collectively referred to as <it>reticulate evolutionary events</it>, result in <it>networks</it>, rather than trees, of relationships. Various software tools have been recently developed to analyze reticulate evolutionary relationships, which include SplitsTree4, LatTrans, EEEP, HorizStory, and T-REX. Results In this paper, we report on the PhyloNet software package, which is a suite of tools for analyzing reticulate evolutionary relationships, or <it>evolutionary networks</it>, which are rooted, directed, acyclic graphs, leaf-labeled by a set of taxa. These tools can be classified into four categories: (1) evolutionary network representation: reading/writing evolutionary networks in a newly devised compact form; (2) evolutionary network characterization: analyzing evolutionary networks in terms of three basic building blocks – trees, clusters, and tripartitions; (3) evolutionary network comparison: comparing two evolutionary networks in terms of topological dissimilarities, as well as fitness to sequence evolution under a maximum parsimony criterion; and (4) evolutionary network reconstruction: reconstructing an evolutionary network from a species tree and a set of gene trees. Conclusion The software package, PhyloNet, offers an array of utilities to allow for efficient and accurate analysis of evolutionary networks. The software package will help significantly in analyzing large data sets, as well as in studying the performance of evolutionary network reconstruction methods. Further, the software package supports the proposed eNewick format for compact representation of evolutionary networks, a feature that allows for efficient interoperability of evolutionary network software tools. Currently, all utilities in PhyloNet are invoked on the command line.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

GS2: an efficiently computable measure of GO-based similarity of gene sets

Author: Ashburner
Beisvag
D. Ruths
Kanehisa
Kanehisa
L. Nakhleh
Lamb
Lein
T. Ruths
Zhang
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: The growing availability of genome-scale datasets has attracted increasing attention to the development of computational methods for automated inference of functional similarities among genes and their products. One class of such methods measures the functional similarity of genes based on their distance in the Gene Ontology (GO). To measure the functional relatedness of a gene set, these measures consider every pair of genes in the set, and the average of all pairwise distances is calculated. However, as more data becomes available and gene sets used for analysis become larger, such pair-based calculation becomes prohibitive

Crossref

PubMed Central

Maximum Parsimony on Phylogenetic networks

Author: A Schrijver
AG Kluge
AWF Edwards
BM Moret
CT Nguyen
D Huson
D Sankoff
D Sankoff
G Jin
G Jin
J Hein
J Hein
JS Farris
JS Farris
L Foulds
L Nakhleh
L Nakhleh
Lavanya Kannan
W Day
Ward C Wheeler
WM Fitch
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background Phylogenetic networks are generalizations of phylogenetic trees, that are used to model evolutionary events in various contexts. Several different methods and criteria have been introduced for reconstructing phylogenetic trees. Maximum Parsimony is a character-based approach that infers a phylogenetic tree by minimizing the total number of evolutionary steps required to explain a given set of data assigned on the leaves. Exact solutions for optimizing parsimony scores on phylogenetic trees have been introduced in the past. Results In this paper, we define the parsimony score on networks as the sum of the substitution costs along all the edges of the network; and show that certain well-known algorithms that calculate the optimum parsimony score on trees, such as Sankoff and Fitch algorithms extend naturally for networks, barring conflicting assignments at the reticulate vertices. We provide heuristics for finding the optimum parsimony scores on networks. Our algorithms can be applied for any cost matrix that may contain unequal substitution costs of transforming between different characters along different edges of the network. We analyzed this for experimental data on 10 leaves or fewer with at most 2 reticulations and found that for almost all networks, the bounds returned by the heuristics matched with the exhaustively determined optimum parsimony scores. Conclusion The parsimony score we define here does not directly reflect the cost of the best tree in the network that displays the evolution of the character. However, when searching for the most parsimonious network that describes a collection of characters, it becomes necessary to add additional cost considerations to prefer simpler structures, such as trees over networks. The parsimony score on a network that we describe here takes into account the substitution costs along the additional edges incident on each reticulate vertex, in addition to the substitution costs along the other edges which are common to all the branching patterns introduced by the reticulate vertices. Thus the score contains an in-built cost for the number of reticulate vertices in the network, and would provide a criterion that is comparable among all networks. Although the problem of finding the parsimony score on the network is believed to be computationally hard to solve, heuristics such as the ones described here would be beneficial in our efforts to find a most parsimonious network.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Locating a Tree in a Phylogenetic Network in Quadratic Time

Author: BME Moret
G Cardona
IA Kanj
JM Chan
K McBreen
L Iersel van
L Nakhleh
L Parida
L Wang
P Jenkins
T Dagan
T Marcussen
TJ Treangen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/02/2015
Field of study

International audienceA fundamental problem in the study of phylogenetic networks is to determine whether or not a given phylogenetic network contains a given phylogenetic tree. We develop a quadratic-time algorithm for this problem for binary nearly-stable phylogenetic networks. We also show that the number of reticulations in a reticulation visible or nearly stable phylogenetic network is bounded from above by a function linear in the number of taxa

arXiv.org e-Print Archive

Crossref

Hal-Diderot

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

Circular Networks from Distorted Metrics

Author: C Daskalakis
C Semple
D Bryant
D Bryant
DH Huson
DH Huson
DH Huson
E Mossel
E Mossel
HJ Bandelt
I Gronau
J Felsenstein
L Nakhleh
M Cryan
M Steel
M Steel
MR Lacey
PL Erdös
PL Erdös
T Warnow
Publication venue
Publication date: 18/07/2017
Field of study

Trees have long been used as a graphical representation of species relationships. However complex evolutionary events, such as genetic reassortments or hybrid speciations which occur commonly in viruses, bacteria and plants, do not fit into this elementary framework. Alternatively, various network representations have been developed. Circular networks are a natural generalization of leaf-labeled trees interpreted as split systems, that is, collections of bipartitions over leaf labels corresponding to current species. Although such networks do not explicitly model specific evolutionary events of interest, their straightforward visualization and fast reconstruction have made them a popular exploratory tool to detect network-like evolution in genetic datasets. Standard reconstruction methods for circular networks, such as Neighbor-Net, rely on an associated metric on the species set. Such a metric is first estimated from DNA sequences, which leads to a key difficulty: distantly related sequences produce statistically unreliable estimates. This is problematic for Neighbor-Net as it is based on the popular tree reconstruction method Neighbor-Joining, whose sensitivity to distance estimation errors is well established theoretically. In the tree case, more robust reconstruction methods have been developed using the notion of a distorted metric, which captures the dependence of the error in the distance through a radius of accuracy. Here we design the first circular network reconstruction method based on distorted metrics. Our method is computationally efficient. Moreover, the analysis of its radius of accuracy highlights the important role played by the maximum incompatibility, a measure of the extent to which the network differs from a tree.Comment: Submitte

arXiv.org e-Print Archive

Crossref

Improved Nonrelativistic QCD for Heavy Quark Physics

Author: B. A. Thacker
B. Sheikholeslami
C. T. H. Davies
Charles Nakhleh
E. Eichten
G. P. Lepage
G. P. Lepage
G. P. Lepage
G. P. Lepage
G. Peter Lepage
K. Symanzik
K. Symanzik
Kent Hornbostel
L. L. Foldy
Lorenzo Magnea
M. H. L. Pryce
M. Lüscher
M. Lüscher
P. Weisz
S. Tani
S. Tani
Stephen Wolfram
Ulrika Magnea
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/1992
Field of study

We construct an improved version of nonrelativistic QCD for use in lattice simulations of heavy quark physics, with the goal of reducing systematic errors from all sources to below 10\%. We develop power counting rules to assess the importance of the various operators in the action and compute all leading order corrections required by relativity and finite lattice spacing. We discuss radiative corrections to tree level coupling constants, presenting a procedure that effectively resums the largest such corrections to all orders in perturbation theory. Finally, we comment on the size of nonperturbative contributions to the coupling constants.Comment: 40 pages, 2 figures (not included), in LaTe

arXiv.org e-Print Archive

Crossref

CERN Document Server

Efficient inference of bacterial strain trees from genome-scale multilocus data

Author: Baba
C. Than
Diep
Errington
H. Innan
Kubatko
Kuroda
L. Nakhleh
Maddison
Milkman
Ochman
R. Sugino
Retchless
Rokas
Saitou
Stoltzfus
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

Motivation: In bacterial evolution, inferring a strain tree, which is the evolutionary history of different strains of the same bacterium, plays a major role in analyzing and understanding the evolution of strongly isolated populations, population divergence and various evolutionary events, such as horizontal gene transfer and homologous recombination. Inferring a strain tree from multilocus data of these strains is exceptionally hard since, at this scale of evolution, processes such as homologous recombination result in a very high degree of gene tree incongruence

Crossref

Graduate University for Advanced Studies [SOKENDAI] Institutional Repository

PubMed Central

The accuracy of fast phylogenetic methods for large datasets

Author: John K. St.
Moret B. M. E.
Nakhleh L.
Roshan U.
Warnow T.
Publication venue: World Scientific Pub.
Publication date: 12/12/2006
Field of study

Infoscience - École polytechnique fédérale de Lausanne